Summary

I investigate the use and supply of scientific R&D by the manufacturing sector using Input-Output data from 2017. Scientific R&D is primarily considered an output from establishments, and the value of scientific R&D as an intermediary input represents only 2% of the total value of scientific R&D as a product. The transportation industry, specifically aircraft and space sub-sectors, use a relatively high percentage of the use of R&D as an input, followed by the plastics and rubber product manufacturing, and petroleum and coal product manufacturing. The manufacturing sector accounts for approximately 30% of the total supply of scientific R&D (which is primarily supplied by the scientific R&D services NAICS code 541700).

Introduction

The use and supply of scientific research and development by the

The role of innovation and technology development through R&D investments has long been linked to the competitiveness and productivity of the manufacturing sector. As early as the 1950s, empirical studies demonstrated the positive contribution of R&D expenditures to total factor productivity growth in manufacturing (Griliches 1980). More recently, research has highlighted the increasing reliance on external science by manufacturers. For example, Arora et al. (2016) find that large U.S. manufacturing firms substantially increased their linkages to public science between 1975-2006, with over 50% of new manufacturing patents citing public science by 2006. At the same time, the U.S. manufacturing sector remains an important performer of domestic R&D. Data from the National Science Foundation shows that while the manufacturing share of U.S. R&D performance declined from over 60% in the 1960s to under 30% by 2010, manufacturing continues to account for over 70% of U.S. business R&D and the majority of U.S. R&D devoted to applied research and experimental development (NSF 2020). Understanding the dual role of manufacturing as both a key supplier and consumer of scientific knowledge provides insight into the sector’s innovation ecosystem.

Total Use and Supply of Scientific Research and Development

In the input-output use table, scientific research and development is identified by commodity code “541700”. It is also useful to note that there is a specific “Scientific Research and Development Services” industry, which is also industry 541700.1 As a commodity, scientific research and development is the seventh highest output commodity, and represents a large portion of total output in the country.

## # A tibble: 10 × 2
##    `Commodity Description`                                Total Commodity Outp…¹
##    <chr>                                                                   <dbl>
##  1 Owner-occupied housing                                                1553178
##  2 Other real estate                                                     1291178
##  3 Hospitals                                                             1058239
##  4 State and local government (other services)                            836043
##  5 State and local government (educational services)                      832472
##  6 Petroleum refineries                                                   781483
##  7 Scientific research and development services                           647677
##  8 Monetary authorities and depository credit intermedia…                 640251
##  9 Insurance carriers, except direct life                                 636540
## 10 Federal general government (defense)                                   599358
## # ℹ abbreviated name: ¹​`Total Commodity Output`

As an industry, however, scientific research and development services is the 34th highest output industry. In the scientific R&D industry, the value of intermediary inputs make up just under 53% of the value of total output by this industry. Fascinatingly, as a commodity, the value of scientific R&D as an intermediary input represents only 2% of total commodity output! This curious result suggests that scientific R&D is accounted for more as an output than as an input. Unsurprisingly, we see that professional, scientific, and technical services use the majority of scientific R&D. The manufacturing sector uses just over 6% of the total supply of scientific R&D as an input.

However, in terms of the output of scientific R&D, the manufacturing sector accounts for 30% of the total production of scientific R&D - the second highest total share of the supply of scientific R&D.

With this summary of the total use and supply of scientific R&D, I turn to a more in depth analysis of the use and supply of scientific R&D by the manufacturing industry in specific.

Manufacturing Sector R&D Use

Because the use of scientific R&D as an intermediary input is so much smaller than the total output of scientific R&D, I first summarize trends in the use of R&D across manufacturing sectors. The aircraft manufacturing sector makes up for the highest percent share of total R&D use, followed by propulsion units and parts for space vehicles and guided missiles. Overall, the transportation equipment manufacturing set of industries make up the largest portion of use of scientific R&D. Curiously, computer and electronic manufacturing as an industry does not use a significant percentage of scientific R&D as an input. Indeed, the largest computer and electronics manufacturing sub-sector’s use of scientific R&D are the Electromedical and electrotherapeutic apparatus; and the electricity and signal testing instruments manufacturing sub-sectors, but even these sectors only make up .1% of the total use of R&D as an input, and .01% of total industry use of intermediate inputs. As a note, the “miscellaneous professional, scientific, technical, and other services” category has activity across many more computer and electronics sectors. This activity might be describing certain design activities, given that the only other explicitly “design” focused industries/commodities are computer and specialized design services.

Plastics and rubber manufacturing as well as petroleum and coal product manufacturing make up the other manufacturing industries that account for the total use of R&D by the manufacturing sector. The distribution of the use of R&D by manufacturing industries varies substantially, and the 3-digit NAICS sectors that see a high percent of the total use of R&D as an intermediate input are not always the ones that might be expected (e.g. food manufacturing, fabricated metal product manufacturing, over electrical equipment manufacturing).

## Warning: The following aesthetics were dropped during statistical transformation: label
## ℹ This can happen when ggplot fails to infer the correct grouping structure in
##   the data.
## ℹ Did you forget to specify a `group` aesthetic or to convert a numerical
##   variable into a factor?
## Warning: Removed 112 rows containing missing values (`geom_label_repel()`).

We also see that the use of R&D makes up for a very small portion of the total use of intermediate inputs by manufacturing industries. The industries and sectors that represent a high percent of the total use of R&D generally also see higher total R&D share of industry inputs. Notably, fabricated metal manufacturing has a higher share of total industry output relative to how much total R&D supply that sector uses.

Certain paper manufacturing industries appear to have a higher share of total industry output relative to how much total R&D supply those sectors use, and that aircraft manufacturing uses a larger total percent of R&D relative to the R&D share of industry total intermediate inputs.

In summary, the use of R&D as an input makes up a very small percentage of both the total total value of scientific R&D, as well as the use of intermediary inputs by manufacturing sub-sectors. Nonetheless, certain transportation, plastics and rubber products, petroleum , fabricated metal, printing, and paper manufacturing sub-sectors make up either a relatively high share of total R&D use, or see a relatively high share of R&D percent of total use of intermediary inputs.

Manufacturing Sector Supply of Scientific R&D

The manufacturing sector supply of scientific R&D is much more interesting. As mentioned above, the manufacturing sector accounts for roughly 30% of total scientific r&D supply (compared to the manufacturing sector’s 13% share of national GDP in 2019). However, there is substantial variation in the sectors that supply R&D, as well as in the intensity of sectoral production of R&D. The computer and electronics manufacturing sector has the highest R&D output out of the 3-digit manufacturing sectors. This sector is followed quickly by chemical manufacturing, with a slight drop off before transportation equipment manufacturing, account for around 5% of total R&D supply. Miscellaneous (specifically the surgical and medical adjacent manufacturing sub-sectors in this sector), as well as machinery manufacturing sectors follow after, after which there is another drop-off before fabricated metal, food, and electrical equipment manufacturing.

However, we also see that the chemical manufacturing sector has the sector with the greatest share of total R&D supply (pharmaceutical manufacturing), and that there is a wide distribution of supply of R&D across the computer and electronics manufacturing sector, as well as the transportation manufacturing sector. The sub sectors that contribute the most to total R&D supply in the transportation manufacturing sector also include other manufacturing of: aircraft parts, as well as guided missile and space vehicle; automobile; and propulsion units and parts for space vehicles and guided missiles.

## Warning: The following aesthetics were dropped during statistical transformation: label
## ℹ This can happen when ggplot fails to infer the correct grouping structure in
##   the data.
## ℹ Did you forget to specify a `group` aesthetic or to convert a numerical
##   variable into a factor?
## Warning: Removed 186 rows containing missing values (`geom_label_repel()`).
## Warning: ggrepel: 26 unlabeled data points (too many overlaps). Consider
## increasing max.overlaps

Each manufacturing industry contributes little to total R&D supply. This is not surprising. Because of the way that NAICS codes work, we expect that establishments with R&D output will be classified in the 500’s, rather than in manufacturing. Curiously, the industries that contribute more to the total supply of R&D are NOT always the same as the industries that have a higher share of total industry supply. Certain manufacturing sectors such as electronic computer manufacturing produce R&D valued at more than 50% of total industry supply, and 18 sub-sectors across the chemical, computer & electronics, medical, and transportation manufacturing sectors produce scientific R&D valued at more than 10% of their total industry output.

## Warning: The following aesthetics were dropped during statistical transformation: label
## ℹ This can happen when ggplot fails to infer the correct grouping structure in
##   the data.
## ℹ Did you forget to specify a `group` aesthetic or to convert a numerical
##   variable into a factor?
## Warning: Removed 206 rows containing missing values (`geom_label_repel()`).
## Warning: ggrepel: 8 unlabeled data points (too many overlaps). Consider
## increasing max.overlaps

Manufacturing sub-sector percent of total scientific research and development supply is not always correlated with the “intensity” of scientific R&D in that sub-sector. The intensity of scientific R&D activity is the value of scientific R&D as an output as a percent of total sub-sector output. For example, pharmaceutical manufacturing accounts for almost 5% of total R&D supply, far more than any other manufacturing sub-sector accounts for, and yet R&D output accounts for less than 20% of the value of total sub-sector outputs. Similarly, semiconductor and related device manufacturing accounts for a relatively higher share of total R&D supply, while electronic computer manufacturing accounts for a relatively low share of total R&D supply given the intensity of R&D output in that sector (more than 50% of the value of total sub-sector output).

## Warning: Removed 204 rows containing missing values (`geom_label_repel()`).

Input Use Clusters Production of Scientific R&D

*RETURN TO THIS - DOESN’T MEAN ANYTHING WITHOUT A QUALITATIVE DESCRIPTION OF CLUSTER ASSIGNMENTS

The Distribution of R&D Activity Across Geographies

Manufacturing activity is distributed unevenly across geographies in the United States. The above discussions illustrate sectoral level variation (meaningful at the 6-digit level) in R&D activity by manufacturing sub-sectors, and provide three measures of R&D activity: 1) the use of R&D as an input by manufacturing sub-sector, 2) the supply of scientific R&D as a percent of total R&D supply, and 3) scientific R&D output share of total sub-sector output value. While the use of scientific R&D as an input might be interesting, especially in the case of highlighting how certain sectors such as petroleum refineries, other plastic products, aircraft, and space propulsion manufacturing operate differently than most other manufacturing sub-sectors in their use of scientific R&D as an input, I focus on R&D outputs as a measure of R&D activity.2

To accomplish this, I join the input output data to earlier created County Level Business Patterns data about the geographical distribution of manufacturing establishments.

The location of manufacturing activity varies substantially by geography, and is dominated by specific large manufacturing clusters across chemical, computer and electronic, medical, and transportation equipment manufacturing. However, how does the geographic distribution of manufacturing activity change if only R&D intensive manufacturing sub-sectors are considered?

Mapping the distribution of R&D activity by establishments across the US reveals that R&D intensive industries (as measured by the top 20 industries in terms of R&D output activity), are distributed across only 477 (or 25%) of the counties across the US. Dental laboratories have by far the most number of establishments, followed by other electronic component, pharmaceutical, and surgical and medical instrument manufacturing.

When examining the distribution of R&D intensive manufacturing by employment, aircraft and pharmaceutical manufacturing dominate the total number of employees, followed by surgical and medical instrument, and semiconductor and related device manufacturing.

R&D intensive activity is distributed across only a few states. Most prominently, the large manufacturing employment centers in the Midwest disappear when focusing only on R&D intensive manufacturing. The R&D intensive manufacturing sectors that locate in the midwest tend to be in chemical manufacturing, medical device manufacturing, or computer and electronic product manufacturing. There is substantial aircraft in Connecticut, and pharmaceutical manufacturing in New Jersey. Across the rest of the north-east, R&D intensive manufacturing activity is concentrated in pharmaceuticals, surgical and medical equipment, semiconductor, wireless component, and other electronics component manufacturing.

In the southeast, the density of manufacturing establishments is far smaller than across all manufacturing industries, which is unsurprising given the large poultry manufacturing, furniture, rug mills, and ship-building clusters in this region. In Georgia, we see Gulfstream Aerospace Corporation in aircraft manufacturing. Interestingly, there remains a high density of relatively heterogeneous manufacturing activity in Florida. We see a few regional large establishments in pharmaceuticals, broadcast and wireless communications equipment, surgical and medical instrument manufacturing. However, there are a number of much smaller establishments across a variety of computer and electronics manufacturing sub-sectors. There is an interesting regionality to Florida as well. Louisiana is notably sparse in R&D intensive manufacturing, and the manufacturing cluster around machinery manufacturing and other industries around Houston notably disappears. Austin’s semiconductor manufacturing and Dallas’ aircraft manufacturing cluster stand out in Texas. Aircraft manufacturing persists in Kansas, Southern California, and Washington, and space propulsion manufacturing in Arizona and Washington. Notably, space manufacturing does not make up for a large portion of employment, while there is a large employment presence from semiconductor manufacturing establishments in Washington. A relatively dense and distributed cluster around Boulder Colorado persists, in addition to the surgical device, pharmaceutical and semiconductor manufacturing cluster in Salt Lake City.

In Southern California, outside of the aircraft manufacturing cluster, there remains substantial activity in dental laboratories, surgical device manufacturing, a variety of computer and electronics manufacturing, and pharmaceutical manufacturing. In northern California there is a large pharmaceutical and surgical device manufacturing cluster, along with a variety of computer and electronics manufacturing sub-sectors. Every state has at least one R&D intensive manufacturing sub-sector; however, some states appear to support a higher density of R&D intensive manufacturing sub-sectors than others. Regionally, the R&D intensive manufacturing sub-sectors are not always the same sectors that dominate employment and manufacturing activity (e.g. in the Midwest), and in other regions, such as Florida, the density as well as the heterogeneity in manufacturing activity appears to persist.

Regional Variation in R&D Activity

At this point, it is worthwhile to benchmark the share of manufacturing employment in R&D intensive subsectors against total employment in the region.

R&D intensive manufacturing activity makes up for a large percentage of employment in certain counties, such the 13% of employment aircraft manufacturing accounts for in Snohomish County Washington, or the 28% of employment pharmaceutical manufacturing accounts for Barceloneta Minicipio Puerto Rico. R&D intensive manufacturing activity makes up for a relatively larger share of employment than establishments, and dental laboratories/surgical & medical instruments manufacturing across Indiana, New York, Arkansas, Virginia, Oregon, Nebraska and Louisiana make up for a relatively large share of total employment and or establishments.

STATE Level Variation in R&D Intensive Manufacturing

There is also substantial heterogeneity in the composition of R&D manufacturing activity at the state level.

At the state level, R&D intensive manufacturing establishments make up less than .3% of total establishments. While California unsurprisingly has the highest share of R&D intensive manufacturing establishments, the three states preceding it are Utah, New Hampshire, Massachusetts, and Arizona. Dental laboratories make up for a large percentage of the total R&D intensive manufacturing establishments.

R&D intensive manufacturing employment accounts for more than 2% of total employment in Washington, and aircraft manufacturing is a substantial employeer across Washington, Connecticut and Kansas. Semiconductor manufacturing in Idaho, Vermont, and Oregon also makes up for a large share of total employment.

The sectoral and regional density and diversity of R&D intensive manufacturing is also highly heterogeneous. I explore the use of two different measures of diversity/concentration: the shannon-weiner diversity index (log(p_share)*p_share) and the Herfindahl-Hirschman index (sum of p_share^2)) for both employment and establishments. Because I am primarily interested in the diversity of geograpies and industries involved in R&D intensive manufacturing, I focus primarily on the Shannon -Weiner diversity index.3Counties themselves vary substantially in the diversity of industries that they support.

At the county level, employment and establishment diversity do not appear to be correlated, with either measure of diversity/concentration, and the two different measures of diversity/concentration appear to be roughly linearly correlated (e.g. high concentration HHI counties are also less diverse Shannon Index Counties).

Across both indicies, highly concentrated counties are also the counties with the largest share of total county employment (e.g. Snohomish County, WA, Monroe County, IN; Chatam County, GA; and Washington County Oregon)

There is a less tight correlation between the county proportion of total establishments and the diversity/concentration of R&D intensive manufacturing establishments. For example, Santa Clara, CA has high diversity and low concentration, as well as a high percentage share of total county establishments. Middlsex County, MA; as well as Orange County, CA; Boulder County, CO, Alameda County CA, and Hillsborough County, New Hampshire are also all relatively diverse counties with relatively high share of total establishments.

States with a larger share of manufacturing employment seem to be slightly less geographically diverse (e.g. Indiana, Wisconsin, Iowa, Alabama) than states with a lower share of manufacturing employment.

However, the geographical and industrial concentration of R&D activity by states varies substantially. Some states, such as Hawaii, Alaska, and Wyoming only see R&D intensive manufacturing activity in one geography and one industry (dental laboratories in the main county). Other states, such as Arkansas, West Virginia, South Dakota, and North Dakota only see R&D intensive manufacturing in 1 industry (dental laboratories), but spread across multiple geographies. Other states, such as Vermont, Rhode Island, and Delaware only see R&D intensive manufacturing in one geography, but distributed across multiple industries. For other states, these industrial and geographic diversity are roughly linearly correlated. Utah, California, and New Hampshire stand out as particularly high industry diversity states. In general, more states see diversity in industries, rather than geographies.

SIZE

I conclude my analysis by considering how R&D intensive manufacturing activity is distributed across firms of different sizes, and how this distribution compares against that for all manufacturing.

  1. Distribution of size across sectors
  2. Distribution of firm size across geographies? (Do we have the data to do this? Maybe skip)

I begin by considering the distribution of R&D intensive manufacturing activity across sectors at the national level. Unsurprisingly, dental laboratories are dominated by small establishments. Certain industries do not have any establishments with more than 500 employees (capacitor, resistor, coil transformer, and other inductor manufacturing; other communications equipment manufacturing; telephone apparatus; totalizing fluid meter; computer storage device; military armored vehicle, tank and tank component; and other guided missile and space part manufacturing). Other industries, such as guided missile and space vehicle propulsion have no establishments below 20 employees. Some industries see large concentrations of activity in establishments with between 50 and 250 employees, but overall, the distribution of R&D intensive manufacturing activity across establishments of different sizes is quite heterogeneous.

The location of scientific R&D activity

Scientific R&D is especially concentrated in southern and northern California, and has a large cluster of activity in New Mexico, as well as along the east coast and the mid-west. The location of scientific R&D establishments is slightly offset from the location of large manufacturing centers, and appears especially sparse in the southwest, the southern mid-western states (e.g Missouri), and in Texas.

## Warning: There was 1 warning in `mutate()`.
## ℹ In argument: `emp_tot = as.numeric(gsub("[[:punct:] ]+", "", EMP))`.
## Caused by warning:
## ! NAs introduced by coercion

R&D intensive manufacturing in particular, appears to co-locate with R&D intensive manufacturing in certain states, but not others. In California and across the east-coast, the density of scientific R&D and R&D intensive manufacturing establishments is very high. However, across the southesast, and into Florida, there are far fewer scientific R&D establishments. In the northern midwest, scientific R&D generally follows R&D intensive manufacturing, but is absent from the southern midwest. In texas, scientific R&D is located in Houston and Austin, while R&D intensive manufacturing is in Dallas. In northwest in Washington, R&D activity does appear to follow the R&D intensive manufacturing clusters in this region (e.g in Washington), but the semiconductor manufacturing clusters in this area appear to be slighly separate from the location of scientific R&D activity.

Some questions to think about: - What percentage of state employment/gdp occurs in R&D intensive industries? Are there any states where employment/gdp makes up a higher percent of state activity than others? - What percentage of county employment/gdp occurs in R&D intensive industries Are there any counties where employment/gdp makes up a higher percent of county activity than others?


  1. As a note, the commodity code “5419A0:All other miscellaneous professional scientific and technical services” is only 14% the total output of 541700, but intermediate inputs in this commodity are 94% the value of total outputs. Industry code 5419A0 is 34% of the total value of scientific R&D 541700, and the value of the use of intermediate outputs is 37% the total output of this industry↩︎

  2. Including a cutoff for inputs of the R&D share of industry inputs being greater than 0.1% or the industry share of total R&D input use being greater than 0.05% adds 20 industries to a list of 20 industries, for a total of 40 industries.↩︎

  3. The HHI index is a measure of competition and market concentration. Because we do not know how industries might compete or collaborate together, this measure might not make the most sense.↩︎